Method for classifying wine and coffee

ABSTRACT

The present invention relates to a method for classifying beverages of natural origin such as wine or coffee. Classification is carried out by means of NIR spectroscopy and corresponding numerical-mathematical conditioning of the spectral data of the individual beverage samples, wherein respective obtained spectra are then correlated with a predetermined beverage class. By means of the method of the invention it is possible to classify wines by sort of wine, growing regions, grape, vine, vintage, kind of material or wood of the wine cask used, and varying degree of maturity of the wine, and by other chemical parameters. Coffee, for example, may be classified by coffee sort, country of origin, coffee growing region, roasting method and defined chemical parameters, e.g., caffeine content, or chlorogenic acid content.

FIELD OF THE INVENTION

[0001] The present invention relates to a method for classifying beverages of natural origin. The invention in particular serves for the classification of wine and coffee.

BACKGROUND OF THE INVENTION

[0002] The classification of wines, e.g., according to sort of wine, growing region, grape, and vintage, is at present only possible by the sensory subjective way through the excellently trained sense of smell and taste of a wine connoisseur. Besides the inaccuracies inherent in nature such as, e.g., in the differentiation of single vintages of a wine, these sensory qualities are limited to only a relatively small number of persons.

[0003] Accordingly attempts have not been scarce to examine the various wines by scientific methods, for instance the analysis of single chemical parameters such as sugar content, acidity, ethanol content etc., and/or by physico-chemical methods such as optical rotary dispersion, index of refraction etc., and achieve a classification as named at the outset through interpretation of the single data, a group of data, or the entirety of such data.

[0004] Owing to the complex composition of the wine on the one hand, but on the other hand the similarity of single parameters, any attempts at giving a reliable statement with the aid of analytical methods, e.g. about sort of wine, growing region, grape and vintage of a wine in question, have failed up to the present.

[0005] However, for various reasons it is sensible to have at one's disposal a reliable method for classifying wines. For one thing, monitoring trade products in terms of food technology and their conformity with statutory requirements is hereby possible, for it is possible to detect, e.g., whether the criteria of designation of the growing region are complied with, such as, e.g., whether or not an inadmissible blending with another grape/growing location exists. On the other hand, monitoring the production process and maturation in storage in the course of wine production by the wine grower would be conceivable with such a classification system.

[0006] Apart from the above mentioned classical prior art of wine analytics it is moreover known from the graduation thesis entitled, “Anwendung multivariater Methoden und kunstlicher neuronaler Netze zur Klassifizierung von Spirituosen miffels Headspace-GC/MS-Kopplung”, presented by Patrick Kursawe, Chair for Analytical Chemistry of Ruhr-Universitat Bochum 1998, to classify various spirits ranging from grappa to rum, with relative reliability by applying multivariate methods and artificial neuronal networks and principal components analysis.

[0007] With regard to classification of wine by the modern chemometrical methods, Montanarella et al. (Montanarella, T., Bassani, M. R., Broas, O. (1995): Chemometric Classification of Some European Wines Using Pyrolysis Mass Spectrometry, Rapid Comm. Mass Spectrom. 9 (15), 1589-1593] have attempted, by utilizing various multivariate methods as well as backpropagation networks, to classify wines with regard to their country of origin with the aid of pyrolysiso mass spectra. A fine differentiation between different regions did, however, fail.

SUMMARY OF THE INVENTION

[0008] Starting out from the prior art of Montanarella et al. (1995) it is therefore an object of the present invention to furnish a reliable method for classifying beverages of natural origin, in particular wines and coffees according to—besides the visually recognizable color—at least wines and coffees, respectively.

[0009] In particular, the method of the invention for classifying beverages of natural origin includes the following steps:

[0010] a) providing a plurality of beverage classes, with a plurality of calibration beverage samples per class each, having a plurality of known class properties;

[0011] b) irradiating measurement light from a predetermined wavelength range into the beverage samples;

[0012] c) detecting the measurement light passed through, reflected, re-emitted, and/or dispersed from, the beverage samples;

[0013] d) determining the wavelength-dependent ratio of irradiated to detected measurement light (spectrum) for each beverage sample of a class;

[0014] e) performing numerical-mathematical conditioning of the spectral data of the individual beverage samples;

[0015] f) correlating the spectra of a plurality of beverage samples with a predetermined beverage class;

[0016] g) compiling a database from the conditioned spectral data with different beverage classes based on the measured beverage samples of the individual classes for calibration of a class correlation;

[0017] h) repeating at least once the steps b) to e) with at least one beverage sample having at least partly unknown properties; and i) determining the beverage classes to which the unknown beverage sample is to be associated, with the aid of a class correlation of the measured spectra, by using the compiled calibration database of step g).

[0018] In a particularly preferred manner, wine and coffee are used as a beverage.

BRIEF DESCRIPTION OF THE FIGURES

[0019] Further advantages and features of the present invention result from the description of embodiments and by reference to the drawings, wherein:

[0020]FIG. 1 shows a first cluster representation of the wine sorts: Chianti and Lagrein;

[0021]FIG. 2 shows a second cluster representation of the wine sorts: Chianti and Lagrein;

[0022]FIG. 3 shows a first cluster representation of the wine sorts: Chianti, Lagrein and Cabernet;

[0023]FIG. 4 shows a second cluster representation of the wine sorts: Chianti, Lagrein and Cabernet;

[0024]FIG. 5 shows a third cluster representation of the wine sorts: Chianti, Lagrein and Cabernet;

[0025]FIG. 6 shows a cluster representation of the Cabernet vintages 1997 and 1998; and

[0026]FIG. 7 shows a cluster representation of the maturing process.

DETAILED DESCRIPTION

[0027] By the method of the invention it is possible for the first time to classify a wine sample at least with regard to its associated sort of wine (besides the visually detectable color). As a rule, however, even classifications according to grape, growing region and vintage are possible.

[0028] The expression “class” or “wine class” is understood, for the purposes of the present invention, as a group of wines having defined properties, i.e. class properties, such as, e.g., sort of wine, grape, growing region and vintage.

[0029] Thus, for instance, the wine class “Chianti Antinori 1996” may exhibit the class properties: sort of wine: “red wine, Chianti type”, “grape: main constituent: Sangiovese”, growing region: “South Tyrol”, and vintage “1996.”

[0030] By the method of the invention it was possible by way of example to achieve an unambiguous classification with unknown wine samples for the following wines:

[0031] Pure wines: Lagrein and Cabernet: 1997 and 1998 vintages, Laimburg Wine Research Center (Weinforschungszentrum Laimburg); and

[0032] Chianti Classico Villa Antinori: 1996 vintage, growing regions Tyrol and South Tyrol, grape: main constituent: Sangiovese.

[0033] Moreover it was surprisingly found that the method of the invention is also particularly well suited for the classification of coffee in accordance with the classes: coffee sort; country of origin, coffee growing region; roasting method; chemical parameters, in particular caffeine content, bittering content, acidity, in particular content of chlorogenic acids, toxicological parameters, in particular content of herbicides and pesticides.

[0034] In accordance with the present invention it is preferred to record an NIR spectrum of the wines in question without any further preparation of samples.

[0035] For this purpose, e.g., a commercially available NIR-VIS spectrometer may be used. Numerical-mathematical conditioning of the spectral raw data may be carried out with an equally commercially available software, e.g. BCAP V 6.00 [BOHLER AG, ANATEC, CH-9240 Uzwil, Switzerland].

[0036] Class correlation may also be performed with a commercially available software, such as Nircal 3.0 [Buchi AG, CH-9230 Flawill], e.g., through principal components analysis and clustering. The result may be represented, for example, in the form of a cluster representation as a 3-D plot, wherein the axes represent the principal components.

[0037] In order to calibrate the method of the invention, initially of a plurality of wine samples known with regard to sort of wine, grape or grapes, growing region and vintage (as a rule at least 10 samples/class property) one respective NIR spectrum is measured, as a rule repeatedly, in order to buffer statistical variations. This data is as a rule conditioned numerically-mathematically in order to reduce the bulk of data and concentrate on the essential characteristics of the spectra.

[0038] Then the method is correlated with these samples such that multivariate methods like principal components analysis, clustering, artificial neuronal networks, are applied to this conditioned data, to be able to state based on the abundant data whether or not an unknown wine sample, when also measured by NIR spectroscopy, belongs to this class.

[0039] Multivariate methods refer to evaluation methods utilizing more than just one measurement signal of a same sample in order to arrive at an analysis result. Among these methods there are i.a. Multi-linear Regression (MLR), Principal Components Analysis (PCA), Principal Components Regression (PCR), the method of Partial Least Squares (PLS), clustering methods, and artificial neuronal networks.

[0040] For the artificial neuronal networks in particular the following algorithms may be considered: backpropagation networks, Dynamic Learning Vector Quantization (DLVQ algorithm), Radial Basis Functions (RBF networks), in particular RBF networks (RBF-DDA networks) trained with the Dynamic Decay Adjustment algorithm (DDA algorithm).

[0041] PCA performs a separation of the original data matrix into two matrices, referred to as factor values and loadings. In the original data space a vector is selected in such a way that a maximum possible part of the variance is imaged when projecting the data onto it. This vector is the first principal component. A second principal component is orthogonal to the first principal component, and optionally a third principal component is orthogonal to the first and second principal components, wherein the second and third principal components are to image as much as possible of the variance not described yet by the first and second principal components, respectively.

[0042] The coordinates along the first principal component contain the essential information of the data, with the second and third principal components essentially reflecting scattering.

[0043] This process is repeated until either the number of the principal components corresponds to that of the dimension of the starting data, or until a particular termination criterion is reached.

[0044] The principal components thus obtained are linear combinations of the original dimensions. They are linearly independent of each other, so that a defined number of principal components contain less redundant information than the same number of starting variables.

[0045] Moreover the thus-obtained principal components each describe a maximum possible variance of the starting data not described yet by the already existing principal components. As a result, generally the first three to five principal components reflect the essential proportion of the information in the set of data.

[0046] Mathematically speaking, principal components analysis is a characteristic value problem, the fundamental solution of which is known to the person having skill in the art.

[0047] The result of principal components analysis also is a transformation of the N-dimensional original data space, with the result that the first dimensions contain the essential data portions strongly contributing to the overall variance, and the last dimensions basically reflecting no more than the noise content. In this way the structure of the spectroscopic data in question may be represented by plotting the first principal components relative to each other. As a two-dimensional, preferably 3-D, image they are then available for visual evaluation to the user who is left with the option of selecting a representation that enables a classification of wine samples into particular classes and which may, of course, also be automated.

[0048] In calibration, the so-called tolerance circles of the images may then be selected so as to be adaptable to particular classes, where necessary, in order to facilitate classification.

[0049] Preferably about 70% of the totality of wine samples measured per class are used for calibration, and about 30% for validation of the method of the invention.

[0050] For better reproducibility of the method of the invention, the samples are measured at a constant temperature, preferably at approximately 23° C.

[0051] The method of the invention allows classification of wine samples with the following class properties, wherein the following group is encompassed at least in part: sort of wine; growing region; grape; vine; vintage; kind of material, in particular species of wood of the wine cask used for storage/maturing, preferably kind of oak, e.g., American oak, French oak or also Hungarian oak; varying maturity degree of storage in the cask; chemical parameters, in particular ethanol content, sugar content, acidity, 802 content; tannin content; pH; water content; dry residue; polyphenol content; toxicological parameters, in particular glycol content and/or methanol content.

[0052] Further preferred embodiments are within the scope of the present invention.

[0053] In FIGS. 1 to 6, cluster representations for different wines and vintages are shown.

[0054] Initially NIR spectra of various wines were established. These spectra were processed in 20-ml measuring cuvettes in the absence of any further sample preparation, with an NIR-VIS spectrometer (FT-IR universal spectrometer) and with the BCAP V6.0 software (BOHLER Analytical Package, BOHLER AG, Anatec; CH-9240 Uzwil, Switzerland).

[0055] Classification through principal components analysis/clustering was performed with the aid of the NIRCAL 3.0 software (BUHLER AG, Anatec; CH-9240 Uzwil, Switzerland). This was a software for controlling the NIR-VIS spectrometer and chemometrical evaluation of the recorded spectra.

[0056] The optical layer thicknesses for measurement of the spectra in the examples were 0.5 mm or 3 mm.

[0057] All of the samples were measured at a constant thermostated temperature of approx. 23° C.

[0058] The exemplarily examined wines were distributed among three different wine classes:

[0059] 1. Pure wines, i.e., wines a 100% produced from a defined grape and originating from a single growing region (relatively small in the exemplary case). In this wine class, wines of the grape “Lagrein” of Laimburg Wine Research Center, 1997 and 1998 vintages, were used.

[0060] 2. Pure wines, i.e., wines a 100% produced from a single grape and moreover originating from a small growing region. In this wine class, wines of the grape “Cabernet”, equally from Laimburg Wine Research Center, 1997 and 1998 vintages, were used.

[0061] 3. Wines originating from a wide growing region and for whose production a plurality of vines were used. In this wine class a “Chianti Antinori”, 1996 vintage, was used, purchased at various retailers in Tyrol and South Tyrol (main constituent is the “Sangiovese” grape).

[0062] 4. As a quality control for the method of the invention, Majorcan wines were used.

[0063] For calibration of the individual class properties, in the exemplary case at least 15 samples each were employed. In the case of calibration for a vintage, 10 samples were used. The number of scans per spectrum and sample was between 3 and 20.

[0064] In the 3-D plot of the three principal components (cluster representations), it was possible to represent the wine classes of Lagrein, Sangiovese (Chianti) and Cabernet. Unknown samples could be classified accurately with the aid of the method of the invention.

[0065] Moreover with the method of the invention it is possible to discriminate between the 1997 and 1998 vintages in the example of a Cabernet wine, and in unknown samples to state reliably whether and to which one of the exemplarily named vintages they are to be assigned.

[0066] In the following, the parameters are indicated whereby the single 3-D plots of FIGS. 1 to 6 were recorded:

[0067]FIG. 1 shows a first cluster representation of the wine sorts: Chianti and Lagrein, measured and evaluated with the following parameters: Calibration protocol Layer thickness: 0.5 mm Software NIRCAL V3.04 (Build 216) Classes used in calibration set Chianti, Lagrein (total 2/2) Total number of spectra 1-81 (total 81/81) No. of calibration spectra 51/81 No. of validation spectra total 27/81 Wavelength range [1/cm] 4008-9996 (total 500/500) Calibration wavelength range 5628-7404 [1/cm] No. of arithmetic operations for 2 preliminary data processing Preliminary data processing 1. Normalization between 0 to 1*, sequence 5628-7404 2. Second Derivative Taylor 3 Points Chemometrical method Cluster No. of primary factors 5 No. of calibration factors 1-5 (total 5/5)

[0068]FIG. 2 shows a second cluster representation of the wine sorts: Chianti and Lagrein, measured and evaluated with the following parameters: Calibration protocol Layer thickness: 3 mm Properties in project Lagrein, Chianti (total 2/2) Classes used in the calibration set Lagrein, Chianti (total 2/2) Total number of spectra 1-91 (total 91/91) No. of calibration spectra total 51/91 No. of validation spectra total 40/91 Calibration wavelength range 4692-9960 [1/cm] No. of arithmetic operations for 2 preliminary data processing Sequence of preliminary data 1. Normalization between 0 to 1*, processing 4692-9960 2. Second Derivative Taylor 3 Points Cluster Chemometrical method Cluster No. of primary factors 4 No. of calibration factors 1-3 (total 3/4)

[0069]FIG. 3 shows a first cluster representation of the wine sorts: Chianti, Lagrein and Cabernet, measured and evaluated with the following parameters: Calibration Protocol Layer thickness: 3 mm Total number of spectra 1-161 (total 161/161) No. of calibration spectra total 116/161 No. of validation spectra total 45/161 Wavelength range [1/cm] 4008-9996 (total 500/500) Calibration wavelength range 4428-9900 [1/cm] No. of arithmetic operations for 2 preliminary data processing Sequence of preliminary data 1. Normalization by Maxima*, processing 4428-9900 (total 457/500) 2. Second Derivative Taylor 3 Points Chemometrical method Cluster No. of primary factors 6 No. of calibration factors 1-5 (total 5/6)

[0070]FIG. 4 shows a second cluster representation of the wine sorts: Chianti, Lagrein and Cabernet, measured and evaluated with the following parameters: Calibration Protocol Layer thickness: 3 mm Classes used in the calibration set Lagrein, Cabernet, Chianti (total 3/3) Total number of spectra 1-157 (total 157/157) No. of calibration spectra total 116/157 No. of validation spectra total 41/157 Wavelength range [1/cm] 4008-9996 (total 500/500) Calibration wavelength range 4428-9900 [1/cm] No. of arithmetic operations for 2 preliminary data processing Sequence of preliminary data 1. Smooth Average 3 Points processing 2. Second Derivative Taylor 3 Points Chemometrical method Cluster No. of primary factors 4 No. of calibration factors 1-3 (total 3/4)

[0071]FIG. 5 shows a third cluster representation of the wine sorts: Chianti, Lagrein and Cabernet, measured and evaluated with the following parameters: Calibration Protocol Layer thickness: 3 mm Classes used in the calibration set Lagrein, Cabernet, Chianti (total 3/3) No. of calibration spectra total 119/167 No. of validation spectra total 48/167 Spectra unused (U-Set) nothing selected (total 0/167) Wavelength range [1/cm] 4008-9996 (total 500/500) Calibration wavelength range 4428-9900 [1/cm] No. of arithmetic operations for 2 preliminary data processing Sequence of preliminary data 1. Smooth Average 3 Points processing 2. Second Derivative Taylor 3 Points Chemometrical method Cluster No. of primary factors 5 No. of calibration factors 1-5 (total 5/5)

[0072]FIG. 6 shows a cluster representation of Cabernet vintages 1997 and 1998, measured and evaluated with the following parameters: Calibration Protocol Layer thickness: 3 mm Classes used in the calibration set 97, 98 (total 2/2) Total number of spectra 1-60 (total 60/60) No. of calibration spectra total 45/60 No. of validation spectra total 15/60 Wavelength range [1/cm] 4008-9996 (total 500/500) Calibration wavelength range [1/cm] 4512-9996 No. of arithmetic operations for 2 preliminary data processing Sequence of preliminary data 1. Smooth Average 3 Points processing 2. Second Derivative Taylor 3 Points Chemometrical method Cluster No. of primary factors 3 No. of calibration factors 1-3 (total 3/3)

[0073]FIG. 7 shows a cluster formation of the maturing process of wines of the Tempranillo and Cabernet Sauvignon grapes, measured and evaluated with the following parameters: Calibration Protocol Layer thickness: 3 mm Classes used in the calibration set 12.04., 3.7., 22.8. and 18.10.2000 (total 4/7) Total number of spectra Total 213/318 No. of calibration spectra total 105/318 No. of validation spectra total 15/60 wavelength range [1/cm] 4008-9996 (total 500/500) Calibration wavelength range [1/cm] 4512-9996 No. of arithmetic operations for 2 preliminary data processing Sequence of preliminary data 1. Normalization between 0 to 1*, processing 4500-9996 (total 459/500) 2. First Derivation Derivative Taylor 3 Punkte Chemometrical method Cluster No. of primary factors 8 No. of calibration factors 1-4 (total 4/8)

[0074] The parameters listed in the above tables—where not self-explanatory—moreover have the following meanings:

[0075] Calibration protocol: Layer thickness 0.5 mm: The optical layer thickness used for calibration is 0.5 mm.

[0076] Number of arithmetic operations for preliminary data processing: This is the number of mathematical arithmetic operations for preliminary processing of the spectra. 

1. A method for classifying beverages of natural origin, including the following steps: a) providing a plurality of beverage classes, with a plurality of calibration beverage samples per class, each having a plurality of known class properties; b) irradiating measurement light from a predetermined wavelength range into the beverage samples; c) detecting the measurement light passed through, reflected, re-emitted, and/or dispersed from, the beverage samples; d) determining the wavelength-dependent ratio of irradiated to detected measurement light (spectrum) for each beverage sample of a class; e) performing numerical-mathematical conditioning of the spectral data of the individual beverage samples; f) correlating the spectra of a plurality of beverage samples with a predetermined beverage class; g) compiling a database from the conditioned spectral data with different beverage classes based on the measured beverage samples of the indivi9ualclasses for calibration of a class correlation; h) repeating at least once the steps b) to e) with at least one beverage sample having at least partly unknown properties; and i) determining the beverage classes to which the unknown beverage sample is to be associated, with the aid of a class correlation of the measured spectra, by using the compiled calibration database of step g); wherein correlation of the numerically-mathematically conditioned spectral data is performed by cluster formation.
 2. The method according to claim 1, wherein said beverages are wines, and wherein the class properties of the individual wine classes comprise properties selected from the group consisting of: type of wine; growing region; grape; vine; vintage; kind of material, including species of wood of the wine cask used for storage/maturing, including American oak, French oak, Hungarian oak, and mixed forms of woods; degree of maturity of storage in cask; chemical parameters, including ethanol content, sugar content, acidity, and 802 content; tannin content; pH value; water content; dry residue; polyphenol content; and toxicological parameters, including glycol content and methanol content.
 3. The method according to claim 1, wherein said beverages are coffees, and wherein the class properties of the individual coffee classes comprise properties selected from the group consisting of: coffee sort; country of origin; coffee growing region; roasting method; and chemical parameters, including caffeine content, bittering content, acidity, and chlorogenic acids content.
 4. The method according to claim 1, wherein the wavelength range is in the range of approximately 700 to 2,200 nm.
 5. The method according to claim 1, wherein said measurement light is introduced through the beverage samples and/or received from them with the aid of a light waveguide.
 6. The method according to claim 1, wherein the optical layer thickness of the sample is between about 0.2 and 5 mm.
 7. The method according to claim 1, wherein the beverage samples are thermostated for measurement.
 8. The method according to claim 1, wherein said numerical-mathematical conditioning encompasses at least one data reduction selected from the group consisting of normalization, smoothing, 1st derivation, 2nd derivation, multiplicative scatter correction, reciprocal value, square, mean centering, Kubelka Munc transformation, absorption, baseline correction, addition of a constant, and shift negative to zero.
 9. The method according to claim 1, wherein said correlation of numerically-mathematically conditioned spectral data encompasses at least one multivariate method.
 10. The method according to claim 1, wherein the tolerance circles of the individual clusters are adjustable in calibration.
 11. The method according to claim 1, wherein approximately 3 to 20 spectral scans/beverage sample are recorded.
 12. The method according to claim 1, wherein at least approx. 10 beverage samples per class property are used for calibration.
 13. The method according to claim 1, wherein about 70% of all beverage samples measured per class are used for calibration, and about 30% for validation of the method.
 14. The method according to claim 1, wherein the classification result is represented on a screen as a 3-D plot.
 15. The method according to claim 4, wherein the wavelength range is in the range of approximately 1,000 to 2,200 nm.
 16. The method according to claim 6, wherein the optical layer thickness of the sample is between about 0.5 mm and about 3 mm.
 17. The method according to claim 7, wherein the beverage samples are thermostated for measurement at about 23° C.
 18. The method according to claim 9, said at least one multivariate method is selected from the group consisting of a principal components analysis (PCA), a smoothing, a series development, a Taylor series development, an artificial neuronal network algorithm, a backpropagation network, a dynamic learning vector quantization (DLVQ algorithm), a radial basis function (RBF networks), and a RBF networks (RBF-DDA network) trained with a dynamic decay adjustment algorithm (DDA algorithm). 